Incorporating social anchors for ad hoc retrieval
نویسندگان
چکیده
Anchor text has been widely used in web search as an effective complement to web page content. This motivates the investigation of similar sources of evidence about relevance. Social media postings often contain links to associated web pages, although typically not with anchor text. In this paper, we explore the use of these links and the text in the social postings as a form of anchor text (social anchors) for improving ad hoc search. Using a test collection based on ClueWeb09 together with associated social media, we show that by incorporating social anchor features, search effectiveness for “ad hoc” tasks can be significantly improved compared to state-of-the-art approaches. We also investigate the relative importance of social anchor features for retrieval, and show that query-dependent features are usually the key to better search performance.
منابع مشابه
BBN at TREC 7 : Using Hidden Markov
We present a new method for information retrieval using hidden Markov models (HMMs) and relate our experience with this system on the TREC-7 ad hoc task. We develop a general framework for incorporating multiple word generation mechanisms within the same model. We then demonstrate that an extremely simple realization of this model substantially outper-forms tf :idf ranking on both the TREC-6 an...
متن کاملRetrieving Web Pages Using Content, Links, URLs and Anchors
For this year’s web track, we concentrated on the entry page finding task. For the content-only runs, in both the ad-hoc task and the entry page finding task, we used an information retrieval system based on a simple unigram language model. In the Ad hoc task we experimented with alternatieve approaches to smoothing. For the entry page task, we incorporated additional information into the model...
متن کاملAn Evaluation and Analysis of Incorporating Term Dependency for Ad-Hoc Retrieval
Although many retrieval models incorporating term dependency have been developed, it is still unclear whether term dependency information can consistently enhance retrieval performance for different queries. We present a novel model that captures the main components of a topic and the relationship between those components and the power of term dependency to improve retrieval performance. Experi...
متن کاملBBN at TREC7: Using Hidden Markov Models for Information Retrieval
We present a new method for information retrieval using hidden Markov models (HMMs) and relate our experience with this system on the TREC-7 ad hoc task. We develop a general framework for incorporating multiple word generation mechanisms within the same model. We then demonstrate that an extremely simple realization of this model substantially outperforms tf :idf ranking on both the TREC-6 and...
متن کاملThe University of Amsterdam at INEX 2007
In this paper, we document our efforts at INEX 2007 where we participated in the Ad Hoc Track, the Link the Wiki Track, and the Interactive Track that continued from INEX 2006. Our main aims at INEX 2007 were the following. For the Ad Hoc Track, we investigated the effectiveness of incorporating link evidence into the model, and of a CAS filtering method exploiting the structural hints in the I...
متن کامل